Adding ccl_enabled flag during model loading and passing CCL lists during compilation process #623

vjanfaza · 2025-11-18T17:08:13Z

In these changes, instead of passing CCL lists during model loading, I passed a flag called ccl_enabled to specify whether CCL feature is enabled or not and moved passing CCL lists to compilation process.

quic-mamta · 2025-11-19T08:45:23Z

@vjanfaza , Can you please resolve the conflicts on the PR and run lint/format checks?

vjanfaza · 2025-11-20T00:18:32Z

@vjanfaza , Can you please resolve the conflicts on the PR and run lint/format checks?

I resolved the conflicts and pushed the changes.

quic-mamta · 2025-11-27T08:39:52Z

examples/performance/compute_context_length/qwen3moe_example/ccl_qwen3moe_inference.py

-comp_ctx_lengths_prefill = [256, 512, ctx_len]
-comp_ctx_lengths_decode = [256, 512, ctx_len]
+# In moe models when compiling with prefill_seq_len=1 and non-continuous-batching mode, prefill and decode will share the same ccl specializations.
+comp_ctx_lengths_prefill = [256, 512, ctx_len]  # None #


nit; please remove the #None # at the end of this line from other places/files as well.

quic-mamta · 2025-11-27T08:41:17Z

examples/performance/compute_context_length/qwen3moe.py

+
+model_name = "Qwen/Qwen3-30B-A3B-Instruct-2507"
+"""
+# For CB inference, set continuous_batching to True and add full_batch_size,mxfp6,mint8 argument in compile function


nit, should be mxint8 not mint8

quic-mamta · 2025-11-27T08:43:30Z

examples/performance/compute_context_length/qwen3moe.py

+    comp_ctx_lengths_prefill=comp_ctx_lengths_prefill,
+    comp_ctx_lengths_decode=comp_ctx_lengths_decode,
+)
+# mos=1,


please remove this line.

quic-mamta · 2025-11-27T08:45:13Z

examples/performance/compute_context_length/qwen2_5_vl_cb.py

    processor=processor,
    images=image_urls,
    generation_len=100,
+    device_ids=[28, 29, 30, 31],


make these as [0,1,2,3]

quic-mamta · 2025-11-27T08:47:32Z

examples/performance/compute_context_length/llama4.py

    inputs["pixel_values"] = inputs["pixel_values"].to(torch.float32)
    streamer = TextStreamer(tokenizer)
-    output = qeff_model.generate(inputs=inputs, device_ids=[0, 1, 2, 3], generation_len=100)
+    output = qeff_model.generate(inputs=inputs, device_ids=[8, 9, 10, 11], generation_len=100)


this should be kept as original.

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

quic-hemagnih · 2025-12-05T09:59:59Z

@vjanfaza Can you please fix the CI https://qraniumci.qualcomm.com/blue/organizations/jenkins/quic_efficient-transformer_public/detail/PR-623/24/pipeline

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

vjanfaza requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners November 18, 2025 17:08

quic-rishinr requested a review from quic-mamta November 19, 2025 08:28

vjanfaza closed this Nov 19, 2025

vjanfaza force-pushed the CCL-main branch from 024ca29 to 30c334b Compare November 19, 2025 18:44

vjanfaza reopened this Nov 19, 2025

vjanfaza closed this Nov 23, 2025

vjanfaza force-pushed the CCL-main branch from f48fbfc to c17be77 Compare November 23, 2025 18:24

vjanfaza reopened this Nov 24, 2025

quic-mamta reviewed Nov 27, 2025

View reviewed changes

vjanfaza force-pushed the CCL-main branch 2 times, most recently from 4461b41 to b8dd26c Compare December 4, 2025 00:54

vjanfaza added 10 commits December 4, 2025 09:03

Adding ccl_enabled flag during model loading and passing CCL lists du…

d58736d

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

da18659

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

70203fd

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

67181e5

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

d8c98ab

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

0520c2d

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

a97115a

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

b9613c4

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

e4579b6

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

7093829

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

vjanfaza force-pushed the CCL-main branch from e260aef to 7093829 Compare December 4, 2025 17:04

Adding ccl_enabled flag during model loading and passing CCL lists du…

7560749

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Adding ccl_enabled flag during model loading and passing CCL lists du…

399fdbd

…ring compilation process Signed-off-by: Vahid Janfaza <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding ccl_enabled flag during model loading and passing CCL lists during compilation process #623

Adding ccl_enabled flag during model loading and passing CCL lists during compilation process #623

vjanfaza commented Nov 18, 2025

Uh oh!

quic-mamta commented Nov 19, 2025

Uh oh!

vjanfaza commented Nov 20, 2025 •

edited

Loading

Uh oh!

quic-mamta Nov 27, 2025

Uh oh!

quic-mamta Nov 27, 2025

Uh oh!

quic-mamta Nov 27, 2025

Uh oh!

quic-mamta Nov 27, 2025

Uh oh!

quic-mamta Nov 27, 2025

Uh oh!

quic-hemagnih commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding ccl_enabled flag during model loading and passing CCL lists during compilation process #623

Are you sure you want to change the base?

Adding ccl_enabled flag during model loading and passing CCL lists during compilation process #623

Conversation

vjanfaza commented Nov 18, 2025

Uh oh!

quic-mamta commented Nov 19, 2025

Uh oh!

vjanfaza commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

quic-mamta Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

quic-mamta Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

quic-mamta Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

quic-mamta Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

quic-mamta Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

quic-hemagnih commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vjanfaza commented Nov 20, 2025 •

edited

Loading